Model selection techniques for the frequency analysis of hydrological extremes: the MSClaio2008 R function
نویسنده
چکیده
The frequency analysis of hydrological extremes requires fitting a probability distribution to the observed data to suitably represent the frequency of occurrence of rare events. The choice of the model to be used for statistical inference is often based on subjective criteria, or it is considered a matter of probabilistic hypotheses testing. In contrast, specific tools for model selection, like the well known Akaike Information Criterion (AIC) and Bayesian Information Criterion (BIC), are seldom used in hydrological applications. The paper of Laio et ̃al. (2008) verifies whether the AIC and BIC work correctly when they are applied for identifying the probability distribution of hydrological extremes, i.e. when the available samples are small and the parent distribution is highly asymmetric. An additional model selection criterion, based on the Anderson-Darling goodness-of-fit test statistic, is proposed, and the performances of the three methods are compared trough an extensive numerical analysis. In this brief document, an application of the R function MSClaio2008, part of the package nsRFA, is provided. Introduction The problem of model selection can be formalized as follows: a sample of n data, D = (x1, . . . , xn), arranged in ascending order is available, sampled from an unknown parent distribution f(x); Nm operating models, Mj , j = 1, . . . , Nm, are used to represent the data. The operating models are in the form of probability distributions, Mj = gj(x, θ̂), with parameters θ̂ estimated from the available data sample D. The scope of model selection is to identify the model Mopt which is better suited to represent the data, i.e. the model which is closer in some sense to the parent distribution f(x). Three different model selection criteria are considered here, namely, the Akaike Information Criterion (AIC), the Bayesian Information Criterion (BIC), and the Anderson-Darling Criterion (ADC). Of the three methods, the first two belong to the category of classical literature approaches, while the third derives from a heuristic interpretation of the results of a standard goodness-of-fit test (see Laio, 2004). The R function MSClaio2008, part of the package nsRFA, is used on a data sample from the FEH database: > data(FEH1000) The data of site number 69023 are used here: > sitedata serieplot(sitedata[,4], sitedata[,3], ylim=c(0,200), + xlab="year", ylab="Max annual peak [m3/s]")
منابع مشابه
A Robust SAR NLFM Waveform Selection Based on the Total Quality Assessment Techniques
Design, simulation and optimal selection of cosine-linear frequency modulation waveform (CNLFM) based on correlated ambiguity function (AF) method for the purpose of Synthetic Aperture Radar (SAR) is done in this article. The selected optimum CNLFM waveform in contribution with other waveforms are applied directly into a SAR image formation algorithm (IFA) and their quality effects performance ...
متن کاملBayesian Technique for the Selection of Probability Distributions for Frequency Analyses of Hydrometeorological Extremes
Frequency analysis of hydrometeorological extremes plays an important role in the design of hydraulic structures. A multitude of distributions have been employed for hydrological frequency analysis, and more than one distribution is often found to be adequate for frequency analysis. The current method for selecting the best fitted distributions are not so objective. Using different kinds of con...
متن کاملSite selection of water storage based on multi-criteria decision analysis
Water loss can be minimized and conserve through constructing small storage dams for various irrigation purposes to support local livelihood. Geographic information system provides powerful techniques for many hydrological modeling and suitable dam site selection. The current study explored potential sites for small storage dams to meet agricultural requirements in district Malakand, Khyber Puk...
متن کاملFlood Flow Frequency Model Selection Using L-moment Method in Arid and Semi Arid Regions of Iran
Statistical frequency analysis is the most common procedure for the analysis of flood data at a gauged location thatin first step it is needed to select a model to represent the population. Among them, the central moment has been themost common and widely used, and with the using of computers, the application of the maximum likelihood hasincreased. This research was carried out in order to reco...
متن کاملNovel Radial Basis Function Neural Networks based on Probabilistic Evolutionary and Gaussian Mixture Model for Satellites Optimum Selection
In this study, two novel learning algorithms have been applied on Radial Basis Function Neural Network (RBFNN) to approximate the functions with high non-linear order. The Probabilistic Evolutionary (PE) and Gaussian Mixture Model (GMM) techniques are proposed to significantly minimize the error functions. The main idea is concerning the various strategies to optimize the procedure of Gradient ...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2010